Chunking #187

shroffk · 2025-10-07T14:39:50Z

Break very large index and update request into smaller chunks
Address the issue #186

jacomago · 2025-10-08T12:06:24Z

Oh, you're ahead of me #188

shroffk · 2025-10-08T13:35:01Z

An anomaly for sure...
I did make the process parallel too, it might be overkill.

jacomago · 2025-10-08T14:29:26Z

An anomaly for sure... I did make the process parallel too, it might be overkill.

Yes, not sure about the parallelization. Nor using skip.

I think some combination of the two would be good. I forgot to use @value for instance

shroffk · 2025-10-08T19:25:54Z

Well I was trying to find the simplest way to not make copies and also support multi threading
What is your primary concern with the skip

jacomago · 2025-10-09T10:29:53Z

Well I was trying to find the simplest way to not make copies and also support multi threading What is your primary concern with the skip

I just think then you have the extra loop. Wheras mine just does on loop still. But I'm not sure if it matters much.

shroffk · 2025-10-09T13:30:22Z

well I am hoping that the multi threaded bit means that when you have large number of channels being indexed
( like populating for performance testing, or like in ESS's case where the whole CF in wiped out an recreated ) then doing it in chunks sequentially might have some performance issues.

shroffk · 2025-10-13T18:32:45Z

@tynanford do you want to share your opinion

tynanford · 2025-10-14T00:24:35Z

tagging @conorschofield as well

should index.chunk.size and processors.chunking.size have the same default of 10K?

I don't know enough java to have an opinion on skip or how to structure the loops. We also do the same as ESS and re-populate the entire CF instance every so often. All the CF data is stored in IOCs or in a matlab script which adds MML meta-data to CF . So parallelization sounds good to me.

jacomago · 2025-10-14T10:33:27Z

I'm wondering if some of the tests are breaking because there is no longer a consistent ordering of the returned channels.

jacomago · 2025-10-14T10:34:47Z

should index.chunk.size and processors.chunking.size have the same default of 10K?

I based it on the default elastic window size. 10 000 seems to be fine most of the time anyway. We hit the limit at a 170 000 IOC, I calculated that 110 000 is around where the problem is so I think 10 000 is a good default.

github-actions · 2025-10-14T11:42:52Z

Overall Project	1.21% `-2.93%`	❌
Files changed	0%	❌

File	Coverage
ChannelProcessorService.java	0% `-33.98%`	❌
ChannelRepository.java	0% `-25.33%`	❌

github-actions · 2025-10-14T11:43:31Z

Overall Project	1.21% `-2.95%`	❌
Files changed	0%	❌

File	Coverage
ChannelProcessorService.java	0% `-33.98%`	❌
ChannelRepository.java	0% `-25.5%`	❌

shroffk · 2025-10-14T13:49:22Z

I'm wondering if some of the tests are breaking because there is no longer a consistent ordering of the returned channels.

I don't think that is the case with the manual IT tests.

Maybe we can have one preference for both/all the chunking operations.

src/main/java/org/phoebus/channelfinder/ChannelRepository.java

Rename to repository.chunk.size

sonarqubecloud · 2025-10-15T11:13:11Z

Quality Gate failed

Failed conditions
C Reliability Rating on New Code (required ≥ A)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

github-actions · 2025-10-15T11:24:28Z

Overall Project	1.21% `-2.96%`	❌
Files changed	0%	❌

File	Coverage
ChannelProcessorService.java	0% `-33.98%`	❌
ChannelRepository.java	0% `-25.58%`	❌

shroffk requested a review from jacomago October 7, 2025 14:39

shroffk mentioned this pull request Oct 7, 2025

Chunk the calls to ChannelFinder ChannelFinder/recsync#119

Merged

shroffk mentioned this pull request Oct 9, 2025

Adds chunking to the processing #189

Merged

shroffk closed this Oct 9, 2025

shroffk reopened this Oct 9, 2025

shroffk and others added 3 commits October 14, 2025 12:40

Break requests to index large # of channels into chunks #186

a877131

Add Chunking to save(update) requests too #186

536357e

Add chunking to processing

1db740b

jacomago force-pushed the chunking branch from 22a9455 to 5e7bb50 Compare October 14, 2025 11:31

Use sets with chunking to avoid saving same channel

9b72400

jacomago force-pushed the chunking branch from 5e7bb50 to 9b72400 Compare October 14, 2025 11:31

jacomago requested review from anderslindho, domonkos-ess, georgweiss, imretoth-ess, simon-ess and tynanford October 14, 2025 12:59

georgweiss reviewed Oct 14, 2025

View reviewed changes

src/main/java/org/phoebus/channelfinder/ChannelRepository.java Outdated Show resolved Hide resolved

tynanford reviewed Oct 14, 2025

View reviewed changes

src/main/java/org/phoebus/channelfinder/ChannelRepository.java Outdated Show resolved Hide resolved

jacomago added 2 commits October 15, 2025 13:10

Update default chunksize

e713d77

Rename to repository.chunk.size

Add a long timeout

9c30a77

shroffk merged commit 1c7dcfb into master Oct 16, 2025
6 of 7 checks passed

jacomago deleted the chunking branch October 17, 2025 06:23

Chunking #187

Chunking #187

Uh oh!

Conversation

shroffk commented Oct 7, 2025

Uh oh!

jacomago commented Oct 8, 2025

Uh oh!

shroffk commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jacomago commented Oct 8, 2025

Uh oh!

shroffk commented Oct 8, 2025

Uh oh!

jacomago commented Oct 9, 2025

Uh oh!

shroffk commented Oct 9, 2025

Uh oh!

shroffk commented Oct 13, 2025

Uh oh!

tynanford commented Oct 14, 2025

Uh oh!

jacomago commented Oct 14, 2025

Uh oh!

jacomago commented Oct 14, 2025

Uh oh!

github-actions bot commented Oct 14, 2025

Uh oh!

github-actions bot commented Oct 14, 2025

Uh oh!

shroffk commented Oct 14, 2025

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented Oct 15, 2025

Quality Gate failed

Uh oh!

github-actions bot commented Oct 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

shroffk commented Oct 8, 2025 •

edited

Loading